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Abstract 

Separable codes were defined by Cheng and Miao in 2011, moti¬ 
vated by applications to the identification of pirates in a multimedia 
setting. Combinatorially, t-separable codes lie somewhere between t- 
frameproof and (t — l)-frameproof codes: all t-frameproof codes are 
t-separable, and all t-separable codes are (t — l)-frameproof. Results 
for frameproof codes show that (when q is large) there are g-ary t- 
separable codes of length n with approximately codewords, and 

that no q -ary t-separable codes of length n can have more than ap¬ 
proximately codewords. 

The paper provides improved probabilistic existence results for t- 
separable codes when t > 3. More precisely, for all t > 3 and all 
n > 3, there exists a constant k (depending only on t and n) such 
that there exists a g-ary t-separable code of length n with at least 
k. q n /(X 1 ) codewords for all sufficiently large integers q. This shows, in 
particular, that the upper bound (derived from the bound on (t — 1)- 
frameproof codes) on the number of codewords in a t-separable code 
is realistic. 

The results above are more surprising after examining the situation 
when t = 2. Results due to Gao and Ge show that a q -ary 2-separable 
code of length n can contain at most |g 2 i n / 3 l — Igl™/ 3 ! codewords, 
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and that codes with at least nq 2n / 3 codewords exist. Thus optimal 
2-separable codes behave neither like 2-frameproof nor 1-frameproof 
codes. 

The paper also observes that the bound of Gao and Ge can be 
strengthened to show that the number of codewords of a q -ary 2- 
separable code of length n is at most 

q \2n/S\ + l g L"/3j( 9 ln/3j _ 1). 

Key words: Separable codes, probabilistic constructions, multime¬ 
dia watermarking 


1 Introduction 

Let F be a finite set of size q, where q > 2. Let n be an integer, where n > 2. 
For a subset X C F n of words over F of length n, and for k 6 {1,2 ,,n}, 
we define X{k) to be the set of fcth components of words in X, and we define 
the set desc(A") of descendants of X by 

desc(X) = X(l) x X(2) x • • • x X(n). 

For example, if X = {0000,0111,0012} then A"(l) = {0}, X(2) = A(3) = 
{0,1}, A"(4) = {0,1, 2} and desc(A) is the following set of words: 

{ 0000 , 0100 , 0010 , 0110 , 0001 , 0101 , 0011 , 0111 , 0002 , 0102 , 0012 , 0112 }. 

Note that X C desc(A"). 

The notion of a descendant set arose in connection with watermarked mul¬ 
timedia content. In these applications, each user is given a copy of the data 
watermarked in a different way: these watermarks correspond to a g-ary word 
of length n. The watermarks corresponding to users are chosen to lie in some 
carefully constructed code C. An adversary receives t copies with different 
watermarkings (from t compromised users), the watermarks corresponding 
to a set X of t codewords. The adversary then attempts to construct a new 
copy of the data. In certain well-motivated models, an adversary can cre¬ 
ate data with any watermark taken from desc(A"). Various security models 
and applications have been considered; different classes of codes are defined, 
depending on the goals of the adversary. See Blackburn [2]J for a survey of 
early results in the area; see, for example, Blackburn, Etzion, Stinson and 
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Zaverucha [3] for more recent work on generalisations. As an example of the 
combinatorial objects that have been considered in this context, t-frameproof 
codes arose from a paper of Boneh and Shaw [4] that addresses the problem 
of preventing an adversary from framing an innocent user: 

Definition 1. Let t be a positive integer. A code C C F n is t-frameproof if 
for all ICC with |A| < t, we have desc(A) n C = X. 

If users’ watermarks are taken from a t-frameproof code C, an adver¬ 
sary who obtains copies of the data from t users is unable to construct a 
watermarked copy of that data that would have been given to another user. 

More recently, models have been considered for an ‘averaging attack’ 
adversary; see Trappe, Wu, Wang and Liu [13]. Here desc(A"), rather than 
an element of desc(A), may be extracted from the data produced by the 
adversary. Cheng and Miao [8] introduced separable codes in the context of 
this model: 

Definition 2. A code C C F n is a t-separable code if for all distinct A, X' C C 
with |A| < t and |A'| < t, we have desc(A) ^ desc(A'). 

Cheng and Miao |8j pointed out that there is a close relationship between 
t-separable codes and frarneproof codes: 

Theorem 1. [8] Lemmas 4.5 and 4.6] Let t be an integer, t > 2. Any 
t-frameproof code is a t-separable code. Any t-separable code is a (t — 1)- 
frameproof code. 

Proof. Suppose C is not a t-separable code. Let X and X' be distinct subsets 
of C such that |A| < t, |A"'| < t and desc(A") = desc(A"'). By interchanging 
X and X' if necessary, we may assume that X' <2 A". Clearly A" C desc(A") 
and X' C desc(A') = desc(A), and so X U X' C desc(A") D C. The set X 
shows that C is not t-frameproof: desc(A) contains a non-empty set X' \ X 
of codewords not in X. So the first statement of the theorem follows. 

Suppose C is a code that is not (t — l)-frameproof. So there exists ICC 
and c G C\X with |A| < t — 1 and c E desc(A). Define X' = X U{c}. Then 
the sets A and X' show that C is not t-separable, since desc(A") = desc(A"'). 
So the theorem follows. □ 

Cheng and Miao also relate t-separable codes to matrices arising in combi¬ 
natorial group testing, and apply these codes to generalisations of the struc¬ 
tures used by Trappe et al. in their multimedia application. 
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Several other papers have considered f-separable codes. Cheng, Ji and 
Miao [7J provide a detailed analysis of 2-separable codes of lengths 2 and 3, 
finding optimal codes in many cases. Cheng, Fn, Jiang, Lo and Miao [5] 
improve the length 2 results using graph theory and packings derived from 
projective planes. Gao and Ge [9] provide probabilistic constructions of, 
and upper bounds for the number of codewords in, 2-separable codes (which 
we discuss further below). Finally, various related codes have been studied: 
secure separable codes [TO] and multimedia IPP codes mm- 

Consider a g-ary t-separable code C of length n containing as large a 
number of codewords as possible. Bounds [I] on frameproof codes, and The¬ 
orem [T] above, imply that there exists a constant k, depending only on n and 
t, such that 

«g rn/tl < \C\ <(t- i) g rn/(t-i)l (x) 

for all sufficiently large q. If the true optimal value of \C\ is close to the 
lower bound, we can think of t-separable codes as being close to t-frameproof 
codes; if the optimal value of \C\ is close to the upper bound, we can think 
of t-separable codes as being close to (t — l)-frameproof codes. This paper 
asks: Which (if any) of these two situations occur? 

The answer to this question in the case t = 2 follows from the results of 
Gao and Ge [9]. Firstly they give a probabilistic argument (reproduced in 
Section [3] below) which shows that there are at least nq 2n / 3 codewords in a 
2-separable code for all sufficiently large q, for some constant k. Secondly, 
they combine a bound on 2-separable codes of length 2 due to Cheng, Ji and 
Miao [7] with an observation that 2-separable codes can naturally be used 
to construct shorter 2-separable codes over a larger alphabet, to prove the 
following upper bound. Let C be a g-ary 2-separable code of length n. Then 

\C\ < |g 2 f n/3 l - lgi n / 3 l. 

So the answer to our question when t — 2 is ‘Neither situation occurs’, as an 
optimal 2-separable code has about g 2n / 3 codewords for large q. 

In fact, the upper bound of Gao and Ge can be tightened to improve 
the exponent in the leading term (when 3 does not divide n). We show 
(Theorem [3] below) that a 2-separable code C satisfies 

\C\ < gh 2 / 3 ) n l _|_ igh 1 / 3 )”! (gK 1 / 3 )™! _ x) < (3/2)g^ 2 / 3 ) n l. 

When t > 3, we will show that the answer to our question is T-separable 
codes are close to being (t — l)-frameproof codes’. This follows from the 
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upper bound (ED) from (t — l)-frameproof codes, combined with Theorem ED 
below, which says that there exist g-ary t-separable codes of length n with 
at least Kq n ^ t ~ 1 ' 1 codewords for all sufficiently large q, where k is a constant 
depending only on n and t. Just like Gao and Ge’s argument when t — 2, 
Theorem [5] is probabilistic, using random choice with expurgation. However, 
the naive generalisation of Gao and Ge’s argument will not give a good 
enough bound: some care is needed when defining the set of ‘bad’ events 
that should be excluded. 

We comment that Gao and Ge pj Corollary 4.8] also provide a proba¬ 
bilistic existence result for 7-separable codes for t > 3. Indeed, they show 
that a g-ary t-separable code of length n with M codewords exists for some 
value of n satisfying 


n < 


(q-iy 



However, this result is much weaker than Theorem [5] below for many pa¬ 
rameter sets. For example, when n and t are fixed and q —» oo, the result 
of Gao and Ge only guarantees the existence of a code with approximately 
e («-«)/(*+l) coc f ewor( f s f or some constant k. 

The structure of the remainder of the paper is as follows. In Section [2] 
after briefly providing (Theorem [2]) a proof of the upper bound (CD) on the 
number of codewords in a 7-separablc code, we state and prove our improve¬ 
ment (Theorem [3]) of Gao and Ge’s upper bound for t — 2. In Section [3] we 
provide a recap of Gao and Ge’s probabilistic existence result (Theorem [4]) for 
2-separable codes. We include the full proof of this result for completeness, 
to establish notation, and to provide an introduction to the more compli¬ 
cated proof in the general case. This general case (Theorem 0) is contained 
in Section ED 


2 Upper bounds 

Theorem [2] below is due to Cheng, Ji and Miao |7, Lemma 3.7 and The¬ 
orem 3.8]; their proof uses a bound of Blackburn [TJ Theorem 1] on the 
maximum cardinality of a (t — l)-frameproof code. We include a proof here 
for completeness. For simplicity, we use a slightly weaker bound on the max¬ 
imum cardinality of frameproof codes due to Staddon, Stinson and Wei [12]. 


5 



Theorem 2. Let q, n and t be positive integers such that q > 2, n > 2 
and t > 2. Let C be a q-ary t-separable code of length n. Then \C\ < 
(t - i)q\ n /h-m . 

Proof. By Theorem [TJ C is a (t — l)-frameproof code. 

For a subset L C {1,2,..., rt} of components, let Ul be the set of all 
codewords that are uniquely determined by their components in L. (So if 
c G Ul no other codeword agrees with c on all positions in L.) Clearly, 
\U L \<q^. 

Let Li, L 2 , ..., L t _i be a partition of {1,2,..., n} into t — 1 subsets of 
size at most \n/(t — 1)]. We claim that C = lj!=i Clearly this claim is 
enough to prove the theorem, for then 

\C\ <J2\ u Li\ < xy n/(t_1)1 = (t- l)g rn/(t_1)1 . 

i=l i=l 


Suppose for a contradiction that there exists c G C \ Ul v Then 
for each i G {1,2, ... ,t — 1} there exists x ; 6 C \ {c} that agrees with c 
on all components in L,. So, setting A" = {xi, x 2 ,... , x 4 _i} we see that 
c e desc(A). Since c e C \ X, this contradicts the fact that C is (t — 1)- 
frameproof. So the claim, and therefore the theorem, follows. □ 

When t — 2 the bound of Theorem [2] is vacuous. However, Gao and Ge [9] 
obtained a non-trivial bound in this case. We tighten their bound as follows. 


Theorem 3. Let q and n be integers such that q > 2 and n> 2. If C is a 
q-ary 2-separable code of length n, then 


\C\ < g r(2/3)nl + ■V( 1 /3 )n J(gL(i/3) n J - 1) < (3/2)g r(2/3)nl . 


Proof. Suppose C is a code over the alphabet F, where |F| — q. 

Define r = |"(2/3)n] and define s = |_(l/3)nj. Note that r + s = n. 

For a word x G F n , define 7r(x) G F r to be the prefix of x of length r, 
and define u(x) G F s to be the suffix of x of length s. 

For a G F r , define T a C F s to be the set of suffices of words in C beginning 
with a. So 

T a = {cr(x) | x G C fl 7r _1 (a)}. 
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Write t a = |T a |. Clearly 

ici = £ t. 

a eF r 

since C is a disjoint union of the codewords contributing to each of the sets T a . 

We claim that for any distinct a, a' G F r we have \T a n T a /| < 2. To 
show this, we assume for a contradiction that there exist distinct b, b' G 
T a fl T a f. Then the words ab,ab',a'b and a'b' all lie in C. But then the 
sets X = {ab, a'b'} and X' = {ab', a'b} have the same set of descendants, 
contradicting the fact that C is 2-separable. So our claim follows. 

Each set T a contains ( f 2 ) subsets of F s of size 2. The claim above shows 
that no two subsets T a and T a > can contain the same subset of size 2. Since 
F s contains ( 9 2 ) subsets of size 2, 

Y ^ h s ^ s - x )- 

a gF r 

The maximum sum of non-negative integer variables u a such that 

-!) < \q. s {q. s ~ !) 

a eF r 

is achieved when u a = 2 for exactly \q s (q s — 1) values of a, and u a — 1 for 
the remaining values. Hence 

M = ta F u a 

a &F r a £F r 

<2(l q s ( q s -l)) + (q r -±q s (q s -l)) 

= q r + \q\q s - 1 ) 

and so the theorem follows. □ 


3 A lower bound for 2-separable codes 


This section is an exposition of a lower bound for the cardinality of a 2- 
separable code due to Gao and Ge [9j. 


Theorem 4. Let q and n be integers such that q > 2 and n > 2. Then there 
exists a q -ary 2-separable code of length n with M codewords, where 


M 


max < N — 

Ne n 
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In particular, for each value of n there exists a constant k (depending only 
on n) with the following property: For all sufficiently large integers q, there 
exists q-ary 2-separable code of length n with at least ^g( 2 / 3 ) n codewords. 


Proof. The theorem follows by a straightforward application of the technique 
of random choice with expurgation. The details are as follows. Let N be an 
integer to be chosen later. Choose words cq, c 2 ,..., Cn G F n uniformly and 
independently at random. 

For i,j G {1, 2,..., N} with i < j, let E l3 be the event that cq = c r So 
each event E l:] occurs with probability q~ n . Let Rij be the indicator random 
variable taking the value 1 when E hJ occurs, otherwise taking the value 0. 
The expected value of R it j is q~ n . 

For distinct i,j,i',j' G {1,2, . ,.,n} with i < j, with i < if and with 
if < j', set X = {cj,c j} and X' = {cj/,cy}. Let Fijyjr be the event 
that desc(A") = desc(X'). This happens only if X(k) = X’(k ) for all k G 
{1,2, Now, X{k) = X'(k) with probability at most 2 /q 2 : once the 

values of the kth components of c,/ and c y are chosen, the fctli component 
of Cj must agree with one of the kth components of c.j/ and Cy (this event 
occurs with probability at most 2/q), and then there is a unique value that 
kth component of c j must take so that X{k) = X'(k) (and the kth component 
of Cj takes this value with probability 1 /q). So F l3 yy occurs with probability 
at most (2 /q 2 ) n . Let Sijyji be the indicator random variable taking the value 
1 when Fj jj/ ji occurs, otherwise taking the value 0. The expected value of 
Sijyji is at most 2 n q~ 2n . Note that there are 3(^) choices for the quadruple 
(i,j,i',j'). To see this, first note that there are ('J) choices for the subset 
L = {i,j, Once L is fixed, i is determined as the smallest element of L. 

There are then 3 choices for j G L \ {i}. Finally, i' and j' are determined as 
(respectively) the smallest and largest elements from L\{i,j}. 

Let Z = Y.ij R ij + So Z is a random variable which 

counts the number of the events E tJ and Fijyji that occur. By linearity of 
expectation, the expected value of Z is at most 


X>"”+ X 2 V 2n = 

i n i. i i f i f 




cyn —2n 
Z q 


In particular, there is a choice of words c t such that at most b of the events 
E tJ and F l , j y,y occur where 


b 



+ 3 




We fix such a choice of words. 

We form a set / C {1, 2,..., N} by adding i to I whenever E h j or F %hl iy 
occurs. So |/| < b. We define C = {c* | i e {1,2, ...,n} \ /}. By our 
construction of / and by definition of the event E,^. all the codewords in C 
are distinct (we have removed one of each pair {c i; c j} with c* = Cj). Tims 
\C\ > N — b. 

We claim that C is a 2-separable code. Let A", X' C C be distinct non¬ 
empty subsets of codewords with |A"| < 2 and |A'| < 2. If |A"| = \X'\ = 2, 
then desc(A) ^ desc(A'), since by our construction of / and the definition 
of the events F t jyj> we have removed one of the four codewords involved 
when we constructed C from (c l5 c 2 ,..., c N }. So, without loss of generality, 
we may assume \X'\ = 1, which implies that |desc(A') = 1. But then 
desc(A) = desc(A"') implies that X = X' in this situation, contradicting the 
fact that X and X' are distinct. So C is 2-separable. Thus the first statement 
of the theorem follows. 

To prove the last statement of the theorem, we fix n and choose N = 
|^2^( 1 /3)n^(2/3)ri.j. yye observe that 


\C\ = N- 


>N- N 2 q~ n 


N 


q~ n + 3 


N 


4 

4on „—2n 


cyn —2n 
z q 


n q 

> (2“( 1 / 3 ) n - 2-( 2 / 3 ) n g~( 1 /3)n _ 

> l 2 -(l/3)n ? (2/3)n 


0 (l)) g (2/3)n 


when q is sufficiently large. So the final statement of the theorem follows with 
k = 12 _ ( 1 / 3 ) n _ We remark that we have not tried to optimise the constant k 
here: see Gao and Ge [9j for a tight value. □ 


4 A lower bound for ^-separable codes when 

t > 3 

Theorem 5. Let n and t be fixed integers such that n > 2 and t > 3. There 
exists a positive constant k, depending only on n and t, so that there is a 
q-ary t-separable code of length n with at least k q 11 /^- 1 ) codewords for all 
sufficiently large integers q. 

The proof of Theorem [5] will use random choice with expurgation, just as 
in the proof of Theorem [I] above. However, we have to work harder to provide 
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a good enough bound on the number of randomly chosen words we need to 
remove to form our code C : the straightforward approach of Theorem 0] is no 
longer sufficient. Here is a brief summary of the method that is used. 

As before, we choose words Ci, c 2 ,..., Cjv £ F n uniformly and inde¬ 
pendently at random. We want to identify (and ultimately remove ele¬ 
ments from) subsets X,X' C {cq, c 2 ,..., c N } such that |X'| < |X| < t 
and desc(A") = desc(A'). We fix positive integers r, s and s' and restrict 
ourselves to the cases when \X flA'| = r, |A\ X'\ = s and |A'\ X\ = s'. For 
most choices of r, s and s' (choices such that (r, s, s’) lies in a certain set J 
defined below) the approach of Theorem [4] will work: we define a collection 
of appropriate events Fa,b,b'i where A^B.B' C {1,2, are disjoint 

subsets of indices of sizes r, s and s' respectively. However, we cannot do 
this when r — t — 1 and s = s' = 1: the probability of the events Fa,b,b' is 
too large and will affect the leading term. So we need to work harder to deal 
with this case. Let j and j' be defined by X\X' = {c^} and X'\X = {cy}. 
It turns out that the probability that desc(A) = desc(A"') is dominated by 
situations when Cj and Cy agree in many positions. So, rather than have 
events Ejj> corresponding to equality of c j and Cy as in Theorem EO we de¬ 
fine Ejj/ to be the event that c 3 and cy agree in many positions. We then 
define events Ga, j,y that require desc(A") = desc(A"') and also require that 
Ej j / does not occur. The remainder of the proof proceeds as in Theorem 0J 
although various calculations become a little more complicated. 

Proof of Theorem [51 Let q be a fixed positive integer, and let F be a set 
of cardinality q. Let N be an integer to be chosen later. Choose words 
Ci, c 2 , ■ ■ ■, Cv € F n uniformly and independently at random. 

For any i, j G {1, 2, ..., N} with i < j, let E l 3 be the event that c* and 
Cj agree in n/(t — 1) or more components. There are less than 2 n ways of 
choosing a set K of n/(t — 1) or more components; the probability that the 
codewords c, and c 3 agree on the components in Ii is at most q~ n T t ~ 1 \ So 
the event E l3 occurs with probability at most 2 n q~ n ^ t ~ 1 \ Let R h3 be the 
indicator random variable for E l 3 . The expected value of R h3 is at most 
2 n q- n K t - 1 )' 

Let A, B and B’ be disjoint subsets of {1, 2,..., N } such that \B'\ < \B\. 
Define the event Fa,b,b’ as follows. Let X = {c.j | i G AVJ B} and let 
X’ = {cj | i G A U B’}. Then F a ,b,b' is the event that desc(A") = desc(X'). 
Since we have 

desc({cj : i G B}) C desc(A") C desc(A') 
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when our event occurs, we see that for i G B the kth component of c* must 
agree with the kth component of one of the elements of X'. So the probability 
of the event Fa,b,b' is bounded above by ((|A| + \B'\) / q)\ B \ n . Let Sa,b,b' be 
the indicator random variable corresponding to Fa,b,b>- Then the expected 
value of Sa,b,b' is at most (|A| + \B\)^ n qG B \ n . 

Define J to be the set of triples (r, s, s') of integers such that: 

0 < r < t - 1, 

1 < s < t — r, 

0 < s' < s, 

2 < r + s, and 
(r,s,s') ± (t- 1,1,1). 

We will only be interested in the events Fa,b,b> where (|A|, \B\, \B'\) G J. 

We now define events associated with the element (t — 1,1,1) we have 
removed from J. Let A, {j} and {j'} be disjoint subsets of {1,2,..., N} of 
cardinalities t — 1,1 and 1 respectively. Define the event Gajj 1 as follows. 
Let X = (cj | i G A U {j}} and let X' = {c; | i G A U Then 

Gajj 1 is the event that Cj and Cj> agree in less than n/(t — 1) components 
and desc(A") = desc(X'). There are less than 2 n ways of choosing a set K 
of components of size less than n/(t — 1), and Gajj' is the union of the 
sub-events where c j and cy agree exactly in the components in K. Define 

Y = {c t | i G A}. 

We have X{k) is the union of Y(k) and the kt\i component of Cj. Similarly, 
X'{k) is the union of Y(k) and the kth component of c y. The condition 
desc(A") = desc(A"') is equivalent to the condition that X(k) = X'{k) for all 
components k. So on the components k K where c 3 and c 3 > differ, the htli 
components of c j and Cy must both lie in Y(k ); this happens with probability 
at most ((t— l)/q) 2 . When k G K, the codewords Cj and Cy agree in their htli 
components, and this happens with probability 1 jq. When q is sufficiently 
large, {{t — l)/q) 2 < 1/q, and so the probability of the event Gajj' is at most 

2 n (((f- < (2t) 2n g~ (2_1 / (t_1 ))”. 

The above expression is an upper bound on the expectation of the indicator 
random variable Tajj/ corresponding to Gajj 1 - 
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Define TZ = {( i,j ) : 1 < i < j < N}, so \TZ\ < N 2 . For integers 
(r, s, s') G J, define to be the set of triples (A, B, B') of disjoint subsets 
of {1,2..., iV} with |A| = r, \B\ = s and \B'\ = s', so |d> r , s ,s'| < N r+S+S '. 
Finally define T to be the set of all triples ( A,j,j ') where A, {j} and {/} 
are disjoint subsets of {1,2,..., N} and \A\ — t — 1. We see that \T\ < N t+1 . 

We are interested in excluding the events E t J where (i , j ) G TZ, the events 
Fa,b,b> where ( A , B , B') G S r _ StS > with (r, s, s') G J, and the events Gajj 1 
where ( A,j,j') G T. So we dehne a random variable Z by 

Z = Ri ’i + T Am'- 

{i,j}en (r,s,s')(zJ (A,B,B')es r ' S ' S , (. A,j,j')eT 

We have computed upper bounds on the expectation of the indicator variables 
in this expression, and on the number of terms in each sum. From these 
bounds, and from linearity of expectation, we can deduce that Z has expected 
value at most b, where we dehne 

&! = jV 2 2 n g- (1/(t_1))n , 

b 2 = N r+S+S \r + s) sn q~ sn , 

(■r,s,s')6J 

b 3 = lV m (2t) 2 V {2 - 1/(t " 1))n , and 
b = bi + b 2 + b 3 . (2) 

We now proceed as in Theorem [4] There is a choice of the values c* 
such that at most b of the events occur. Fix this choice. Dehne a subset 
/ C {1, 2,..., N} by adding i for each event E h j that occurs, adding a single 
element i G A U B U B' for each event Fa,b,b' that occurs and adding a single 
element i G A U {j} U {j 1 } for each event Gajj' that occurs. We then dehne 
C — { C j | i G {1,2,...,7V}\/}. 

If c j = c j> for some j ^ j' then the event Ejj / holds, so one of j and j' 
lies in /, thus c 3 or c' was removed when C was formed. So all the codewords 
in C are distinct. In particular, \C\ > N — b. 

We now show that C is a t-separable code. Suppose for a contradiction 
that there are distinct non-empty subsets X,X' C C with |X| < t, \X'\ < t 
and desc(A") = desc(X'). Without loss of generality, we may assume that 
\X'\ < \X\. Dehne Y = X n X'. Let r = |D|, s = \X\ Y\ and s' = |X' \ Y\. 
If \X\ = 1 then \X'\ = 1 and we have a contradiction since all codewords 
are distinct. So we may assume that |X| = r + s > 2. We now see that 
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(r, s,s') G J U {(£ — 1,1,1)}. If (r, s, s') G J, then the event Fa,b,b' occurs, 
where A, B and B' are the indices of the codewords in Y, X \ Y and X' \ Y 
respectively. But then at least one element of A U B U B' lies in /, and so we 
have a contradiction in this case. Suppose now that (r, s, s') — (t — 1,1,1). 
Then X = Y L) {c^} and X' — Y U {cy} for distinct j,f G {1,2,... ,N}. 
Without loss of generality, we may assume that j < j'. If Cj and Cy agree 
in n/(t — 1) or more components, then the event occurs and so j G I. 
Thus Cj C, and we have a contradiction. Now suppose that c 3 and cy agree 
in less than n/(t — 1) components. Then the event Gajj 1 occurs, where A 
is the set of indices corresponding to Y. But at least one of the indices 
A U {j} U {j'} lies in /, which gives us a contradiction in this final case. 
Hence C is a /-separable code, as required. 

Finally, we need to show that N can always be chosen so that \C\ > 
h: qn/(t-i) £ or a n sufficiently large q. We set N = |_eg n /h - 1 )J ; where e is a 
sufficiently small positive constant depending only on n and t. We will always 
take e < 1. Note that \C\ = N — b = eg n/(t_1) - b - 0(1). We will now show 
that when N = |_eg n/(t- 1 ) J, 

b < eVg n/(t_1) (3) 

for some constant k' which depends only on n and t. Proving this inequality 
establishes the theorem. For if we choose e < 1/k' we see that e — e 2 n' is 
positive and so 

\C\ > N — b > (e — e 2 K')q n/{t - 1) - 0(1) > k q^^ 

for all sufficiently large q, where k is any positive constant less than e — e 2 n'. 

Recall that b = b\ + b 2 + 63 . We establish (J3J) by bounding each of bi, b 2 
and 63 in turn. First, 

b\ = jV 2 2 ”g-Oh-l) < e 2 2 n q n/{t-l) = 1) 

where (i\ depends only on n. 

We now turn to providing a bound for b 2 . The constant b 2 is defined as 
a sum over (r, s, s') G J. We begin by bounding a term in this sum. 

Suppose that (r, s, s') G J. We have that r + s < t and 1 < s. Moreover, 
r + s + s' > 2 and so 

jyr+s+s 1 S y n q~ sn < e 2 ( r _|_ S yn q (r+s+s’-(t-l)s)n/(t-l) 

< e 2 t tn g( r+s,_ h- 2 )s)Oh- 1 )_ 
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We claim that r + s' — (t — 2)s < 1 for all (r, s, s') G J. For if s > 2, then 
r + s' — (t — 2)s < r + s — 2(t — 2) < t — 2(t — 2) = — t + 4 < 1, 
since t > 3. If s = 1 and r < t — 1, then 

r + s' — (t — 2)s < (t — 2) + s — (t — 2) = 1. 

Finally, if s = 1 and r — t — 1, then s' = 0 and so 

r + s' — (t — 2)s = t — 1 — (f — 2) = 1. 


So our claim follows, and 

jyr+s+s' r r _|_ S ynq-sn < £ 2yn^n/(t-l) 

whenever (r, s, s') G J. Since | J| < (t + l) 3 , we find that 

b 2 = J2 N r+S+S ' (r + s) sn q- sn <e 2 (t + l)H tn q n = e 2 a 2 q n/{t ~ 1) , 

( r,s,s')&J 

where a 2 is a constant depending only on n and t. 

Finally, we provide a bound for b 3 . We can see that 

b 3 = jv m (2f) 2n g~ (2-1/(t_1))n < e 2 (2t) 2n g ((t+1)_2(t ~ 1)+1)(n/(t_1)) . 

Now (t + 1) — 2{t — 1) + 1 = 4 — t<l since t > 3. Hence 

b 3 < e 2 a 3 g n/(t - 1} 

for some constant a 3 depending only on n and t. 

The bounds above combine with ([2]) to show that ([3]) holds, where /d = 
a x + a 2 + a 3 . So the theorem follows. □ 
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